reward hacking (1)

jonathanm.bsky.social
Interested in: cognitive science, Bayes, (ir)rationality, (effective) altruism, happiness, Al Alignment, reward hacking, running, reading books. He/him